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ABSTRACT 

The inverted density is one whose contour lines are 
spheroidal as in the normal distribution, but whose moments differ 
from those of the normal in that its conditional arrays are not 
homo sceda Stic, being quadratic functions of the values of the linear 
regression functions* It is also plat^xirtic, its measure of kurtosis 
ranging from that of the normal to that of the uniform depending on 
the valtie of a parameter: as that parameter increases the inverted 
student distribution approaches normality. Measures of kurtosis are 
given for distributions of scores on a number of cognitive tests, and 
they are almost all seen to be platykurtic. Data are presented 
showing that a quadratic terms contributes substantially to the 
regression of conditional variances on test scores in a bivariate 
distribution. These data suggest that the inverted student 
distribution may provide a better description of distributions of 
test scores than does the normal. (Author) 
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THE INVERTED STUDENT DENSITY AND TEST SCORES 



Robert F. Boldt 
Educational Testing Service 

Abstract 

The inverted student density is one whose contour lines are spheroidal 
as in the normal distribution, but whose moments differ from those of the 
normal in that its conditional arrays are not homoscedastic, being quadratic 
functions of the values of the linear regression functions. It is also 
platykurtic, its measure of kurtosis ranging from that of the nomal to that 
of the uniform depending on the value of a parameter: as that parameter 
increases the inverted student distribution approaches normality . Measures 
of kurtosis are given for distributions of scores on a niomber of cognitive 
tests, and they are almost all seen to be platykurtic. Data are presented 
showing that a quadratic tern contributes substantially to the regression 
of conditional variances on test scores in a bivariate distribution. These 
data suggest that the inv-^rted student distribution may provide a better 
description of distributions of test scores than does the normal. 



THE INVERTED STUDENT DENSITY MD TEST SCORES 

Robert F. Boldt 
,^l.ducaiional Testing Service 

Because of difficulties in handling certain problems in multivariate 
statistics, a polynomial distribution similar to the normal seemed to the 
author to be a promising alternative distribution (Boldt, 1962), However, 
the use of the distribution in analysis proved tedious, and the notion was 
dropped. More recently there has been renewed interest in the distribution, 
known as the Inverted Student Distribution, because of findings indicating 
that variances of conditional test score distributions are, in some cases 
at least, not homoscedastic . Rather, the conditional variances seem to 
decrease at extreme values of the independent variables (Boldt, I968, 1972). 
Also, the platykurtic character of test score distributions has come to the 
author's attention. Both of these characteristics are in contrast to the 
normal distribution and are, indeed, properties of the Inverted Student 
Distribution (Press, 1972;' Raiffa & Schlaifer, I961) as will be shown below. 
Thus the interest in the distribution, which was originally based on its 
possible value as an approximation to the normal, may arise specifically 
because of the characteristics it has which are in distinction to the normal 
but which characterize test scores. 

In the material that follows the distribution is introduced, its 
marginal distributions and the constant which makes the distribution a 
density function are developed. Following this, the relationship of certain 
parameters to the population means and to the conditional ireans is given. 
Then the relationship between certain other parameters and the population 
variances and covariances is given, followed by a development of the 
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conditional variances which exhibits the nonhomoscedastic character of 
the conditional variances. SuLsequently it is shown that the distribution 
tends to normality under certain conditions, followed by a section con- 
cerning higher moments of the univariate marginal distribution. Finally, 
data are presented which show that measures of kurtosis of test data and 
the conditional variances of some tests have properties in agreement with 
those of the distribution. 
Let 

X be a column vector of P random variables; 
N , K , C be constants of the distribution; 
y be a column vector of P parameters; 

A be a P x P symmetric positive definite matrix of parameters. 
Then the density is 

C[K - (X - y)'A(X - M)f (l) 

if the quantity in brackets is nonnegative, zero otherwise. Transform X 
to Y as follows: > 

X = I'^C TY + y (2) 

where 

T'AT = I . 

Then the joint distributior of the Y 's is 
P 

c ^(1-Y'r)^ . (3) 

I. i2 
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Note in (3) that the Y 's are symmetrically distributed and hence have 

mean zero. Using (2) it follows that y is the mean of X . To find C , 
2 

define = > ^^en is the ith element of Y . Then the V 's 

have the Dirichlet distribution. 



Equating the constant of (it) to the constant of the Dirichlet distribution 
(Press, 1972, p. 131) yields 

r(N+^) ,i 

C = V • (5) 

r(N + 1) K ^ 

The interpretation of A and K can be developed as follows. The expected 

value of a is ^ p ^ ^ (Press, 1972, p. 13^) and is the expected 

2 

value of a corresponding Y^ . It can be seen on examination of (3) that 
the covariance of Y 's is zero so that the variance-covariance matrix of 
Y 's is (2N + P + 2)""^I . Then 

E(X - y)(X - y)' = A T[E(YY')]T' v4c 

KTT' 



2N + P + 2 ' 



where E is the expectation operator. But since T'AT = I , it follows 
that (TT')-"^ = A or 

E(X - p)(X - y)^ =^jPp|^A"^ . (6) 

One vould not want the variance-covariance matrix of X to depend on the 
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choice of N and P , and hence we choose K = 2N + P + 2 and note that 
A Is the inverse of the variance-covariance matrix of X 's as in the 
normal distribution. 



Marginal Distributions 

Suppose the variables in X and y are partitioned into subvectors 

that 

X = llx^lx^ll , V - llp^ly^ll > 
and A is partitioned as follows 



A = 









^11 


^12 


1 


A' 




1 ' 


^12 


^22 





where the sizes of the submatrices of A are consistent with the partition- 
ing of X . Then, using the transformation 

X^ - = U - \\\2^^2 " ^2^ ' ^'^^ 
the joint distribution of U and X^ becomes 

C[K - U'A^^U - (X^ - '^^22 " ^12^11^12^ ^^2 ^ ^2^^^ * 

Suppose A ^ is partitioned in a fashion corresponding to the partitioning 
of A given above (this is by (6) a partitioning of the variance-covariance 
matrix of X ), yielding say B^^ , B^^ , B^^ with the subscripts corres- 
ponding to sections of the partitioning of A , then it can be shown that 

A - A' A~'^A = B""^ 
^22 ^12 iri2 22 • 

Transforming U to Y as follows: 
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where T'A^^T = I and using (5) yields 



r(N.^) [K- (X,. .,)'B^j(X3-.,)] 2 _ 

r(N + 1) TT^ K 2 |A^J^ 

when there are Q variables Y , (e.g., A^^ and B^^ are Qx Q ; A^^ 
and Q ^ (P - Q) )• "^hen using the transformation 

2 

V. = YT 
1 1 

and the Dirichlet constant yields 

r(N + ^) . 2 J, 2 

as the marginal distribution of the remaining variables. Expression (lO) 
is written with B^g using the relation 

|a| = 1^1^22 ^ ^12^11X2! (Graybill, I96I, pp. 8-9)- 

From (10) it can be seen that the marginal distributions are of the same 
form as (l), i.e., a quadratic form subtracted from a constant and raised 
to power. Thus any distribution for which the form (l) is assumed might 
in some contexts be regarded as a marginal distribution. 



Conditional Means and Variances 

From the expression (8) it can be seen that U is symmetrically 
distributed for any value of and hence has mean zero for that value 

of . Therefore, from (7), the mean of X^ given X^ is 

\ - ^n^i2^^2 - ^2^ • 

It can be shown that -A^^A^2 ^12^22 ^"^^ ^1 ^12^22^ ^2 " ^2^ 

the mean of given X^ . This is the familiar regression formula. 

Conditional variances are not homoscedastic in this distribution. 
That this is true can be seen as follows. Note that if a multivariate 
distribution function is CF , where C is the density constant and U 
is one of its arguments with mean zero, then the conditional variance of 
U is 

/ CF / F 



u 



(11) 



r r 

/CF / F 

In the present distribution, F is of the form in brackets in (8) and 
we are taking U and A^^ as scalars, i.e., Q equals unity. Then if 
we use transformation 

\}\^ = V(K - (X^ - P2^^22^^2 ^ ^2^^ ' ^^2) 

the quantity in parentheses in (12) above by which V is multiplied, 

factors out of the brackets in (8) and, including the differential, is 

3 1 
raised to the N + ^ power in the numerator of (ll), and to the N + — 



power in the denominator. The conditional variance is of the form 



and W turns out to be [A^-j^(K - P + l)]" . 

Tendency Toward Normality 

In this section it is shown that all marginals of the distribution 
approach normality as N approaches infinity. To do this factor K , 
which was defined as equal to 2N + P + 2 , out of the brackets in (l) 
and use the expression for C in (5) to get 



where Q is the quadratic form in the brackets of (l). Using Stirling's 
approximation in the quantity in braces above and letting N «> 
yields 



which is the form of the multivariate normal distribution function. 
Hif^her Moments 

From expression (lO), taKing Q = P - i obtain the univariate marginal 
distribution as 



W[K - (Xg - M2)'222(X2 " ^2^^ ' 



(13) 





P 
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Then 

V- {x-vf 

has the Dirichlet distribution. 



r(N . 4 N.^-|i 

^ V '^[l - V] ^ 



r/n r(N + 

The rth moment of V about the origin, which is the 2rth moment of 

is (Press, 1972, p. 133) 

r(r + i)r(N + ^-^-) 
^ r(N + + r) 



Moments of V can be used to get the even moments of X , and from the 
symmetry, all odd moments are zero. 



E(V) -~ E 1 ^^-^^ 



2 r(i) N + ~ 2N + P + 2 



2 2 

Since K = 2N + P + 2 , it follows that E(x - y) = a is in agreement 
with an earlier result. 

For kurtosis take r = 2 , to get 

P + 2j 

E'-^"" ^ ^ 3.1 



L Ko^ j 'r(i) r(».H^* 



,-2 2 ,„,P*2^i)(„,?4l,2) 



and 



2N -f P -f 2 
2N + P «■ U 



(15) 



Note that lim = 3 as in the normal distribution. Further, since li, is 



monotonic in 2N + P , it is at a minimum when the expon'Jnt in (l^) equals 

zero, i.e., 2N + P - 1 equals zero. Kence, a minimum for Uj^ is 
3 

y X 3 = 1.8 , the kurtosis of a rectangular distribution. 

At the outset it was mentioned that this polynomial distribution 
function fits test data, at least in some respects, better than the normal. 
That this is true at least with respect to kurtosis can be seen by exami- 
nation of Table 1 (for which the author is indebted to F. Swinefcrd). The 
first data line in Table 1 indicates that for a certain form of SAT-V, 
which is scored R-W/J4, the skewness and kurtosis measures based on 900 
cases wer. .3^17 and 2.5155 respectively. Further, by looking down the 
column headed Kurtosis, one can see that in almost all cases the measure 
of kurtosis, which is the ratio of the fourth moment around the mean to 
the squared variance, lies between 1.8 and 3.0 as indicated in equation 
(15) and the following discussion. If measures of kurtosis were symmetri- 
cally distributed around three, thi^ probability of observing three or fewer 
kurtosis measures in excess of three is less than ton to the minus eight for 
thirty-two such measures. Clearly, three is certainly too large a number 
to be the median of the kurtosis measures in Table 1. Those three f'>r 
which the normal might be better, on the basis of the kurtosis measures^ 
are the most skewed and hence still might not be very well described by 
the normal distribution. In most cases, the polynomial distribution seems 
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more appropriate than the normal on the basis of the measure of kurtosis, and 
one may note that this holds across test programs (SAT, GRE, LSAT, ATGSB, 
and SSAT) and hence over different scoring formiilae. 

Insert Table 1 about here 

Table 2 contains data similar xo those in Table 1. However, the data 
in Table 1 come from test programs in which many parallel tests have been 
constructed over the years for essentially similar populations, but those 
in Table 2 (lor which the author is indebted to F, Lord) were constructed 
in a somewhat less familiar situation. Also, though the examinees for the 
testing program whose data are presented in Table 2 were volunteers, they 
did include some 80 to 90% of the students in the grades listed who were 
selected on a statewide basis, and who certainly extend the grade range of 
students beyond that provided in Table 1. In short, no attempt is made to 
choose data which are selective against the normal distribution. 

Insert Table 2 about here 

Another, and perhaps more important, kind of agreement between the 
polynomial distribution and data is in the matter of homoscedasticity . 
Table 3 contains data which indicate agreement between the polynomial 
distribution and observation. This table contains two types of evidence. 
The entries for the table come from scatterplots of reported SAT scores 
and shorter tests which are built to the same specifications except for 
length. The data in Table 3 concern reported score variances observed at 
each level of the shorter tests. The first column in Table 3 gives the 



correlations of the variances with the associated short test score level, 
and the second coliann shows the multiple correlation of the variances with 
the score level and its square. Note that in some cases the gain by using 
the quadratic term is substantial, but of course the correlation with two 
independent variables must be better than with one. Therefore, the data 
in the last two columns are of special interest. These data give the 
standard score regression coefficients, and the important thing to note is 
that in every case the- pattern of signs is identical, being negative for 
the quadratic term. The probability that all thirty-one examples would have 
the same sign is two to the minus thirtieth, and thus the trend noted seems 
quite reliable. In fact, the negative sign for the coefficient of the 
quadratic term is predictable on the basis of the polynomial distrioution 
since in the discussion following the expression (8) it is pointed out that 
the coefficients of variables remaining after integration are the elements 
of the inverse of a matrix B^^ which is a principal submatrix of a 
population variance-covariance matrix that will be positive definite. 
Therefore, the quadratic form in (13) must be positive definite, and it 
follows that the coefficient of the squared term is positive in the quadratic 
form or negative in (13). Thus, one would expect the signs of the quadratic 
terms in Table 3 to be negative, which they were. Under a null hypothesis 
of p = .5 (either sign is equally likely) the probability of all signs 
being negative is only two to the minus thirty-one (as opposed to the two to 
the minus thirty mentioned above). 



Insert Table 3 about here 
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Thus, in two ways the data observed give better agreement with the 
polynomial distribution than with the normal. Of course, it is recognized 
that one must not assmne that because the polynomial distribution is 
better in some respects it is better in all respects. There may be other 
deductions for which the normal is better. However, the agreement of the 
polynomial distribution with the lack of homoscedasticity seems fairly 
important in some applications, particularly in their relation to range 
restriction formulae. 
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Table 1 

Skewness and Kurtosis of Various Operational Test Score Distributions 



Scoring 



Test 


Formula 


No. Cases 


Kurtosis 


Skewness 




R-W/U 


900 


2.5155 


.31*17 






865 


2.721*1* 


.1*210 






900 


2.6078 


.1360 






23^*5 


2.5168 


.21*92 






C.\J\J\J 


c. • D^uy 




n 

SAT-IT 


R-W/li 


900 


2.3701 


.231*2 








P L7Q0 


Plli6 






900 


2.3071* 


.0700 






231*5 


2.5009 


.021*5 






2nnn 


2 S2?fi 




GRE*-V 


R-W/U 


1660 


2 "^86? 


- 021 2 






1U85 


2.1*395 


-.0299 






700 


2 1| 566 










2.1*255 


-.0788 


GRE-Q® 


R-W/I4 


1660 


2. 3151* 


-.0671* 






_L ^ vj y 


? ?7li? 


— .J. I J.p 






700 


2.1*956 


-.121*1* 






11*95 


2.1*627 


-.0863 


f 


i\ 




<i . 0 ( uu 








1810 


3.1*599 


-.5097 


LSAT-Sentence Correction 


R 


350 


2.8873 


-.2638 






1810 


2.9505 


-.231*1* 


LSAT-Reading Comprehension 


R 


1810 


2.5161 


-.1293 


LSAT-Data Interpretation 


R 


1810 


2.8832 


.01*81 


LSAT-Reading Recall 


R 


1810 


2.8627 


-.2669 


LSAT-Principles and Cases 


R 


1810 


3.5500 


-.1*215 


ATGSB^-Reading Recall 


R-W/li 


1090 


2.7166 


-.3351* 






1260 


3.11*75 


-.1*912 






1260 


2.8303 


-.3187 


ATGSB-M 


R-W/li 


1090 


2.5628 


-.11*08 






1260 


2.6103 


.2223 






1260 


2.591*3 


.111*7 
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Table 1 (Continued) 





Test 


Scoring 
Formula 


No. Cases 


Kurtosis 


Skewness 


ATGSB-V 




R-W/4 


1090 
1260 
1260 


2.1*796 
2.1*081* 
2.1*298 


.1597 
.0629 
.0763 


SSAT^-V 




R-W/li 


860 
1820 


2.1*076 
2.1*063 


.1659 
.01*53 


SSAT-M 




R-V/k 


860 
1820 


2.7983 
2.7536 


.1*039 
.1*367 


SSAT-Reading Comprehension 




860 
1820 


2.7667 
2.390I* 


.1*1*91* 
.1878 



Scholastic Aptitude Test (SAT). 
Verbal (V) . 
Mathematics (M) . 

d 

Graduate Record Examination (GRE). 

Quantitative, 
f 

Law School Aptitude Test. 

g 

Admissions Test f-^r Graduate School in Business. 
Secondary School Aptitude Test. 
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Table 2 

Survey of Students in a Midwestern State, September 1965 
Kurtosis and Skewness (in Parentheses) of Test Score Distributions* 

Test and 



No. of Items 


h 


6 


8 


10 


English Achievement 


2.2QU 


2.586 


2.372 




20 items 


(-.113) 


(-.092) 


(-.016) 




Math Ability 


2.336 


2.I181 


2.1*26 


2.1*38 


50 items 


(.079) 


(.018) 


(.031*) 


(-.011) 


Math Achievement 


2.611 


2.900 


3.001 


2.861* 


50 items 


(.132) 


(.138) 


(.276) 


( .200) 


Reading 


2.629 


5.837 


2.893 




30 items 




(-1.955) 


(-.287) 




Verbal Ability 


2.975 


2.1*91 


2.238 


2.1*26 


50 items 


(.39M 


(.112) 


(.087) 


(-.001) 


^0 entry is based 


on less than 


19,500 cases. 


all volunteers. 
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Table 3 

Correlation between Linear and Quadratic Functions of Equating Test Scores 
and the Conditional Variances and Standard Score Regression 
Coefficient for the Quadratic Function 



Correlations Regression Terms 

Test 



SAT-M 



SAT-V 



Linear 


Quadratic 


Linear 


Quadratic 


.3671 


.6759 


1.5920 


-2.0396 


.0100 


.8823 


2.1*882 


-2.61*91* 


.3311 


.8950 


2.7800 


-2.5862 


.3009 


.5079 


.8909 


-1.2601 


.0158 


.1810 


.1*856 


-.5033 


.0126 


.8181^ 


2.1283 


-2.2919 


.0278 


.9179 


2.7297 


-2.8525 


.0322 


.8518 


2.2592 


-2.381*2 


.2287 


.9558 


3.2376 


-3.5881* 


.2563 


.7507 


2.8917 


-2.7232 


.1*807 


.7687 


1.7599 


-2.3196 


.3861* 


.7027 


1.8381 


-2.3005 


.3037 


.1*656 


1.011*3 


-1.361*5 


.1179 


.7061 


2.1*821* 


-2.6919 


.5800 


.7331* 


1.0378 


-1.6850 


.6531 


.8880 


1.59^*5 


-2.3267 


.2615 


.5233 


1.2869 


-1.6131* 


.OI169 


.731*0 


2.35^*9 


-2.5110 


.181*9 


.1*899 


1.1059 


-1.3632 


.0985 


.8230 


2.1*892 


-2.5350 


.0521 


.8230 


2.1*892 


-2.5350 


.0521 


.7811* 


2.295I* 


-2.1*736 


.3161 


.7213 


1.6617 


-2.0813 


.1708 


.7695 


2.1*960 


-2.1*1*52 


.3795 


.7766 


2.1826 


-2.6801 


.2380 


.3280 


.6155 


-.8828 


.1*377 


.771*1* 


1.9791 


-2.1*998 


.1*193 


.711*2 


1.7678 


-2.2619 


.2791* 


.i*157 


.8013 


-I.I36I* 


.0531* 


.6771 


2.5506 


-2.6901 


.0978 


.1020 


.2072 


-.1132 


.0170 


.0665 


.2598 


-.2511 
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